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IMAGEA^OICE COMMUNICATION TERMINAL DEVICES 
CLAIM(S) . 

1) An image/voice communication terminal device equipped with a 
direction-changeable image display unit, and comprising: a voice sample 
collecting means for collecting the voice samples at multiple positions 
around the image/voice communication terminal device; a target direction- 
determining means for determining the target direction by comparing the 
sound volumes of voice samples collected by said voice sample collection , 
means; a display direction-changing means for changing the display 
direction of said display means to the target direction determined by said 
target direction-determining means. 

2) An image/voice communication terminal device equipped with a 
photographing direction-changeable video camera, and comprising: a voice 
sample collecting means for collecting the voice samples at multiple 
positions around the image/voice communication terminal device; a target 


direction-determining means for determining the target direction by 
comparing the sound volumes of voice samples collected by said voice 
sample collection means; a photographing direction-changing means for 
changing the photographing direction of said video camera to the target 
direction determined by said target direction-determining means. 
3) An image/voice communication terminal device equipped with an image 
display direction-changeable means and a photographing direction- 
changeable video camera, and comprising: a voice sample collecting means 
for collecting the voice samples at multiple positions around the 
image/voice communication terminal device; a target direction-determining 
means for determining the target direction by comparing the sound volumes 
of voice samples collected by said voice sample collection means; a 
direction changing means for changing the display direction of said display 
means and the photographing direction of said video camera to the target 
direction determined by said target direction-determining means. 
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DETAILED DESCRIPTION OF THE INVENTION 
(0001) 

(Field of Industrial Application) 

The present invention pertains to an image/voice terminal device. 
(0002) 
(Prior Art) 

Due to development of a communication network such as that 
represented by an ISDN, a quantity of data that can be communicated per a 
time unit is increasing and terminals that conduct communications by using 
animate images and voices are emerging. 
(0003) 

The terminal of this type is generally equipped with a display unit for 
displaying the reception image data, such as a CRT and a LCD, and is 
generally equipped with a video camera as a means of collecting image data 
for transmitting. 
(0004) 

(Problems of the Prior Art to Be Addressed) 

The aforementioned display unit, however, has a limited visible 
range, particularly, a limited visible angle, so the user has to conduct the 
communications within the visible range of display unit. If the user moves 
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away during the communication, the display unit goes out of his/her sight, 

which is a problem. 

(0005) 

When the video is fixed in position, the user has to move to the 
photographing range of the video camera. To solve this problem, it was 
considered to make the video camera-mounting unit movable. In such a 
case, however, it will be tedious if the user has to change the direction. 
(0006) 

In addition, because of these problems, the user has to be too 
conscious of the terminal, which makes the communication environment 
uncomfortable and unnatural. 
(0007) 

The present invention, to solve the aforementioned problems, 
attempts to present an image/voice communication terminal device that the 
user can use naturally without being too conscious of the visible range of a 
display means and the photographable range of a video camera. 
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(0008) 

(Means to Solve the Problems) 

To accomplish the aforementioned objective, the present invention 
structures the image/voice communication terminal device as described in 
the following 1), 2), 3). 
(0009) 

1) The image/voice communication terminal device equipped with the 
direction-changeable image display unit comprises: a voice sample 
collecting means for collecting the voice samples at multiple positions 
around the image/voice communication terminal device; a target direction- 
determining means for determining the target direction by comparing the 
sound volumes of the voice samples collected by said voice sample 
collection means; a display direction-changing means for changing the 
display direction of said display means to the target direction determined by 
said target direction-determining means. 
(0010) 

2) The image/voice communication terminal device equipped with the 
photographing direction-changeable video camera comprises: a voice 
sample collecting means for collecting the voice samples at multiple 
positions around the image/voice communication terminal device; a target 
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direction-determining means for determining the target direction by 
comparing the sound volumes of the voice samples collected by said voice 
sample collection means; a photographing direction-changing means for 
changing the photographing direction of said video camera to the target 
direction determined by said target direction-determining means. 
(0011) 

3) The image/voice communication terminal device equipped with the . 
display direction-changeable image display means and with the 
photographing direction-changeable video camera comprises: a voice 
sample collecting means for collecting the voice samples at multiple 
positions around the image/voice communication terminal device; a . target 
direction-determining means for determining the target direction by 
comparing the sound volumes of the voice samples collected by said voice 
sample collection means; a direction changing means for changing the 
display direction of said display means and the photographing direction of 
said video camera to the target direction determined by said target direction- 
determining means. 
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(0012) 
(Operation) 

According to the aforementioned structure (1), the direction of the 
user is determined based on the sound volume of the voice samples 
collected at multiple positions around the terminal, and the display direction 
of the display means is directed toward said direction. According to the 
structure (2), the direction of the user is determined based on the sound 
volumes of the voice samples collected at multiple positions around the 
terminal. The photographing direction of the video camera is directed to the 
target direction. According to the structure (3), the display means and the 
video camera are directed to the user by the result of determination output 
in (1) and (2). 
(0013) 

(Embodiment Example) 

The embodiment example of the present invention is explained below 
in detail. 
(0014) 
(Example 1) 

Fig. 1 shows an oblique view of the contour of the "image/voice 
communication terminal device" as one embodiment example of the present 
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invention. In the figure, land 2 show the microphones, respectively, and 8 

the movable display unit. 

(0015) 

Fig. 2 shows a block diagram of the structure of the embodiment 
example. In the figure, 1 and 2 show the microphones, respectively, which 
are the voice input means, and 3 a speaker, which is a voice output means. 
(0016) 

In the figure, 4 indicates a voice interface section for performing a 
switching operation, by the command from the system controller 1 4, 
' between microphones 1 and 2 and speaker 3 as the voice input/output 
means, an echo canceling operation for deleting the echo when the 
microphones 1 and 2 and speaker 3 are used, a transfer operation for 
transferring the voice data collected from the microphones 1 and 2 as the 
voice sample collection means to the system controller 14, and an operation 
of generating a dial tone, call sound, busy tone, and arrival sound tone. 
(0017) 

In the figure, 5 indicates a voice encoding/decoding section for 
encoding the transmission voice signal and decoding the reception voice 
signal by the command from the controller 14, according to the voice signal- 
encoding/decoding algorism of 64 kbpsPCM A-law, 64 kbps PCM |Li-law, 
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64 bps/56 kbps/48 kbps SB-ADPCM, 32 kbps ADPCM, 16 kbps (for 

example, APC - AB), and 8 kbps. 

(0018) 

In the figure, 6 indicates a video camera, which is an image input 
means, for inputting a self-image, 7 an image/character camera, which is a 
image input means, for inputting pictures and graphs, and 8 a movable 
display unit for displaying the input image from the camera 6 or 
image/character camera 7, and the received image, and the operation screen. 
(0019) 

In the figure, 9 indicates the video interface section for performing, 
by the command from the system controller 14, an image input means- 
switching operation, a display switching operation of input image, received 
image, and of operation screen display, an image signal synthesizing 
operation for dividedly displaying them on the display unit 8, and a camera 
control operation for controlling the display direction of the display unit 8. 
(0020) 

In the figure, 10 indicates the video encoding/decoding section for 
encoding the transmission image signal and decoding the received image 
signal according to the CCITT proposal H.261, and 1 1 the data terminal for 
performing the data communication. And 12 indicates the data interface 
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section that provides multiplexing/separating section 15 of the transmission 
data from the data terminal 1 1 and system controller 14, and provides the 
received data to the data terminal 1 1 or system controller 14. 
(0021) 

In the figure, 1 3 indicates an operation unit, such as a keyboard or a 
touch panel, for inputting the control data for controlling the entire device; 
14 indicates the system controller which is equipped with a CPU, ROM, 
RAM, and with an auxiliary memory device, and which creates the 
operation/display screen according to the control status of the entire device 
by monitoring each section, executes the application program, commands 
the voice sample collection to the voice interface section 4, compares the 
sound volume data of the voice sample transmitted from the voice interface 
section 4, and teaches the target direction determined by the comparison 
result to the video interface section 9. 
(0022) 

In the figure, 15 indicates the multiplexing/separating section that 
multiplexes the voice signal from the voice encoding/decoding section 5, 
image signal from the video signal encoding/decoding section 10, the data 
from the data interface section 12, and the BAS from the system controller 
14 into the transmission frame unit, and that separates them into each 
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medium in a reception frame unit to inform them to each section, according 
to the CCITT proposal H. 221. In the same figure, 16 indicates the circuit 
interface section for controlling the circuit according to the ISDN user 
network interface. 
(0023) 

Fig. 3 indicates a flowchart of the operation of the embodiment 
example. In the figure, the steps in S2 - S6 are called " an automatic 
display section-tracking operation." The automatic display section- tracking 
operation is automatically activated at the start of the communication and 
can be activated during the communication whenever the operation unit 1 3 
commands it. 
(0024) 

First, the circuit interface section 16 controls the transmission and 
reception and, thus, the communication starts (SI). Then, the system 
controller 14 commands, via the voice interface section 4, the voice sample 
collection to the microphones 1 and 2. The microphones I and 2 begin to 
collect the voice samples (S2). The collected voice samples are transmitted 
to the system controller 14 via the voice interface section 4. The system 
controller 14 compares the voice samples transmitted from the microphones 
1 and 2, respectively, and determines which sound volume is higher (S3). 

1 1 


Based on the result of comparison, the system controller 14 determines the 
direction of the target, i.e., the speaker (S4), and commands the video 
interface section 9 to direct the movable display unit 8 to the target (S5). 
The video interface section 9 directs the movable display unit 8 to the 
commanded direction (S6). 
(0025) 

Hereinafter, the automatic display unit-tracking operation S2 - S6 is 
repeated until the communication is completed (Yes in S8) if the operation 
section 13 commands the automatic display unit-tracking operation. 
(0026) 

(Embodiment Example 2) 

In the aforementioned embodiment example 1 , two microphones were 
used for the voice sample collecting means. 
(0027) 

By contrast, in this example, one movable microphone is used for the 
voice sample collecting means, and when the voice samples are collected, 
the system controller 14 moves/controls the movable microphone to collect 
the voice samples at several positions. 


(0028) 

(Embodiment Example 3) 

Fig. 4 shows an oblique view of the "image/voice communication 
terminal device as embodiment example 3. In the figure, 1 and 2 indicate 
the microphones, and 6 - 1 the movable video camera. 
(0029) 

Fig. 5 shows a block diagram of the structure of the embodiment 
example 3. In the figure, blocks other than 6-1,8-1,9-1,14-1, 
indicate the same blocks as those in Fig. 1, so they are omitted from the 
explanation. 
(0030) 

In the figure, 6- 1 indicates an image input means and a movable 
video camera for inputting a self-image; 8-1 indicates the display unit for 
displaying the image input from the camera 6-1 or picture image camera 7, 
received image from the other party, and operation screen. 
(0031) 

In the figure, 9 - 1 indicates the video interface section that performs, 
by the command from the system controller 14 - 1, the switching operation 
of image input means, the switching operation of input images, the 
reception images, and of operation screen displays, the image signal 
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synthesizing operation for dividedly displaying them on the display section 
8-1, and the camera control operation for controlling the direction of 
movable video camera 6. 
(0032) 

In the figure, 14-1 indicates the system controller which is equipped 
with a CPU, ROM, RAM, and with an auxiliary memory device, and which 
creates the operation/display screen according to the control status of the 
entire device by monitoring the status of each section, executes the 
application program, command the voice sample collection to the voice 
interface section 4, compares the voice data of the voice samples transmitted 
from the voice interface section, and provides the video interface section 9 - 
1 with the direction of the target determined by the result of the comparison. 
(0033) 

Fig. 3 shows a flowchart indicating the operation of the embodiment 
example. In the figure, the operations in the steps S 1 2 - S 1 6 are put 
together and called "an automatic camera-tracking operation." The 
automatic camera-tracking operation is automatically activated at a time of 
starting the communication, and can be activated during the communication 
whenever the operation unit 13 commands it. 
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(0034) 

First, the circuit interface section 16 controls the transmission or 
reception and, thus, the communication starts (SI 1). The, the system 
controller 14-1 commands, via the voice interface section 4, the voice 
sample collection to the microphones 1 and 2. Here, the microphones 1 and 
2 begin to collect the voice samples around the terminal (SI 2). The 
collected voice samples are transferred to the system controller 14 - 1 via 
the voice interface section 4. The system controller 14-1 compares the 
voice samples transmitted from the microphones 1 and 2, respectively to 
determine which sound volume is higher (SI 3). Based on the result of 
comparison, the system controller 14-1 determines the direction of the 
target, i. e., the speaker (SI 4), and commands the video interface section 9 
- 1 to direct the movable video camera 6-1 to the direction of the target 
(SI 5). The video interface section 0-1 directs the movable video camera 6-1 
to the commanded direction. 
(0035) 

Hereinafter, the automatic camera-tracking operations S12 - S16 are 
repeated until the communication ends if the operation section 13 
commands the automatic camera-tracking operation (Yes in. SI 7). 
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(0036) 

(Embodiment Example 4) 

In the aforementioned Embodiment Example 3, two microphones 
were used for the voice sample collection means. 
(0037) 

In this example, one movable microphone is used for the voice 
sample collection means. When the voice samples are collected, the system 
controller 14-1 moves/controls the movable microphone to collect the 
voice sample at several positions. 
(0038) 

In the aforementioned each embodiment example, the display unit or 
video camera is directed to the user, but the present invention is not limited 
to this method; the display unit and video camera can both be directed to 
the user. Also, the direction can be changed by rotating the mounting unit, 
or by changing the reflection direction by using a mirror. 
(0039) 
(Advantage) 

As explained above, according to the present invention, the image 
display means and video camera automatically track down the user, the user 
needs not be conscious of the visible range of the display means or the 
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photographing range of the video camera, which allows comfortable 
communications. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 shows an oblique view of the contour of the embodiment 
example 1. Fig. 2 shows a block diagram of the embodiment example 1 . 
Fig. 3 shows a flowchart of the embodiment example 1 . Fig. 4 shows an 
oblique view of the embodiment example 3. Fig. 5 shows a block diagram 
of the embodiment example 3. Fig. 6 shows a flowchart of the embodiment 
example 3. 

1,2. microphones 

8. movable display unit 

9. video interface section 
14. system controller 
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[0031] 9 - 1 {4. i^X^mmU 14-1 (TiWrs 

tio, mmxfi^mmmi>m. xiimmt^m 

9,-i±.x"i}mm.^>-th-rdsb<m%.m^'^mm. 

^mm^'^r)>^f^^y'!'yx.~xmx'hh> 

[0032] 14-1{4, CPU, ROM, RAM, ffl 

^tfyr/i/t&m^otgs^, ^^^y:?7x-xaJ4*>'olE 
m^fiX%tz^P^yr)Vff)^Pr—S>i)iiW.L. fcT) 
i 0 n/i ^^<r>-}5m t'r ^ y 7 x - 
xg|59 - 1 tct§^-tl.^<7)3in.ag:tT-5 v-xr A0Jffllgi5t' 
$>■?). 

[ 0 0 3 3 ] HSJi, *llififi?iJ<7)Kif^S-^-r70-^A' 
-Nt'S>l.. |5liatcfci'^-C, S12~S16*-C'(0t!jf^ 

1 3t^h<r)^.x\ mm'^^'kisbxmi^mm^bti^ 
x'th. 

[0 0 34] ti^m^^y^y^-xuieif. ^m^b 
h\-^\m^m\m'^^\ m.mifimm-h ( s 1 1 ) . 

tht'yy^TJ-.V\mui A - lif. ^P^y^yx-x 
gB4S:jitT-7^^o*>'i , 2i,z^P^yr)Ui^m<r) 
jg^5-ajL, -7-^ i?o*yi . 2im^mm<r)'^p-^y 
r/u<7)i^m^mi^-ti ( s 1 2 ) . i^m^tifz^p-fy 
r/Ksw^-^ yi'y x-xgB4 ix i^xf-i^mmu 

14-1 tc!!giS$ix. '>XTJ^Mm3 14-1 14v-f ^' 
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at^yi. 2i]^(>^ti^'jxm(^tix^fz^p^yr)Ui: 
mi, t'hh!7)^mA^±^^^:>)-m&th ( s 1 3 ) . 

^IX^cO^M^Zi^-^X, i^XTA«i)Wa514-li4. 

mt^d:hhBmco:ffm^^l ( s 1 4 > . ^<7):ff\^ 
^■BJilj^t'r^;!^^ 5 6-1 i\^l,fhXd th'r^t^ V 
^7x-XgP9-l^i^S:ai-t (SI 5) . t'T:t^ 
y ^ 7 x-xgil9 - 1 (ii^$ix/i:Sri6]^^t!)5^tT^ 
;*^76- lS:|6]ltl. (S 1 6) . 

[0035] am. mmmTtx' ( s 1 s, ye s ) , 
10 1 3 7 iij3isi)f^<7)t^*ij)ix{f ( s 
17. YES) , A^^mmmm^^si2^sie^ 

[0036] {mtm4 ) mimmmsxu. 

[0037] zmzM i^^mmmxu, zco^^p^yr 

tUi^m^m LX. ■BIf)^<7)v-^ ^n;j-.ySr lofflv\ 
^p-fyrjU^mmzii. S^^r A$iJM 1 4 - 1 

ff)^W}^-? A^ut^y mmm^ h^uzxn.m^ 

20 [0038] ti:ii\^J±ff)^mm\i, mr.W>, b-T:t;& 

^ y(7)\^-ftii)^i^Wi^<r>Mz^nh ij<r>X'httK 
if%m\i:iix^zm^-f , t'T7r;i7^7ft{cffiffl 

^co-^^^fizmmx-mmthzti^x'^h. ttz-n^ 
i6] ^sf 5 1 V > -5 mx-mmt ^zthx-^^. 

[0039] 

[HI] *lfipill<^^'h{E#f*Ill 
[02] ^SfiC?l|l<7)7'P'y^ll 
[03] ||5fe0i|l<O7o-^Y-h 

[04] mm3<n^vmnm 

[05] ||}g01j3tfOyo.y:70 
[06] llifiW3i07O— ?-r-h 

40 1,2 v-^^'o*y 

8 nr»«s^f5 

9 h'r:t^y:5'7x-XgS 
14 xXri^^iJIJPS 
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